Model Selection

Multi-turn Dialogue Support

# Multi-turn Dialogue Support

Qwen3 8B Q4 K M GGUF

This is the GGUF format version of the Qwen3-8B model, suitable for the llama.cpp framework and supports text generation tasks.

Large Language Model

Granite 3.3 8b Instruct Q8 0 GGUF

This model is a GGUF format model converted from the IBM Granite-3.3-8B instruction fine-tuned model, suitable for text generation tasks.

Large Language Model

Gemma 2 2b It Tool Think

Text generation model fine-tuned based on google/gemma-2b-it, supporting tool call reasoning process

Large Language Model

Qwen2.5 0.5B Instruct

A 0.5B parameter instruction fine-tuned model designed for the Gensyn reinforcement learning group, supporting local fine-tuning training

Large Language Model

Transformers English

Qwen2.5-14B-Instruct is a 14 billion parameter instruction fine-tuned large language model based on the Qwen2.5 architecture, optimized on the s1K dataset.

Large Language Model

Orpheus 3b 0.1 Ft Q6 K GGUF

This is a GGUF format model converted from canopylabs/orpheus-3b-0.1-ft, suitable for text-to-speech tasks.

Large Language Model English

Gemma 3 12b It Q5 K S GGUF

This is the GGUF quantized version of Google Gemma 3B model, suitable for local inference and supports text generation tasks.

Large Language Model

Gemma 3 27b It Q4 K M GGUF

This model is a GGUF format version converted from Google's Gemma 3 27B IT model, suitable for local inference.

Large Language Model

paultimothymooney

Llama Joycaption Alpha Two Hf Llava FP8 Dynamic

This is an FP8 compressed version of the Llama JoyCaption Alpha Two model developed by fancyfeast, implemented using the llm-compressor tool and compatible with the vllm framework.

Image-to-Text English

Deepseek R1 Distill Llama 8B GGUF

DeepSeek-R1 is an 8B-parameter inference model based on the Llama architecture, utilizing 1.58-bit + 2-bit dynamic quantization technology to enhance precision

Large Language Model English

Internlm3 8b Instruct Gguf

The GGUF format version of the InternLM3-8B-Instruct model, suitable for the llama.cpp framework and supporting multiple quantization versions.

Large Language Model English

Tanuki 8B Dpo V1.0

Tanuki-8B is an 8B-parameter Japanese large language model optimized for dialogue tasks through SFT and DPO, developed by GENIAC Matsuo Lab

Large Language Model

Transformers Supports Multiple Languages

Mistral 7B Banking V2

A banking-specific large language model fine-tuned based on Mistral-7B, focusing on banking transactions and customer support scenarios

Large Language Model

Dolphinhermespro ModelStock

This model is a hybrid created by merging the Dolphin-2.8 and Hermes-2-Pro 7B-parameter models using the LazyMerge toolkit, based on the Mistral-7B architecture.

Large Language Model

Minicpm MoE 8x2B

MiniCPM-MoE-8x2B is a Transformer-based Mixture of Experts (MoE) language model, designed with 8 expert modules where each token activates 2 experts for processing.

Large Language Model

Mistral 7B OpenOrca Q4 K M GGUF

This model is a GGUF format model converted from Open-Orca/Mistral-7B-OpenOrca, suitable for text generation tasks.

Large Language Model English

Sciphi Mistral 7B 32k

A large language model fine-tuned based on Mistral-7B-v0.1, focused on enhancing scientific reasoning and educational capabilities

Large Language Model

Codellama 13b Oasst Sft V10

A version fine-tuned by Open-Assistant based on Meta's CodeLlama 13B large language model, supporting English, with a new RoPE Theta value (1e6 instead of 1e4).

Large Language Model

Transformers English

Vicuna is a chat assistant fine-tuned from Llama 2, trained on user-shared dialogues from ShareGPT.

Large Language Model

This is a German language model based on the GPT-2 architecture, specifically optimized for German text generation tasks.

Large Language Model German

anonymous-german-nlp

Distilbert Base Squad2 Custom Dataset

A model fine-tuned on SQuAD2.0 and custom Q&A datasets based on Distilbert_Base, focusing on efficient Q&A tasks

Question Answering System

The Russian version of GPT-2 is a text generation model developed based on OpenAI's GPT-2 architecture, specifically optimized and trained for Russian text.

Large Language Model

Transformers Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase